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antenatal d M syn prenatal d. 

clinical (L, a d. made from a study of the signs and symptoms of a 
disease. 

differential d,, the determination of which of two or more dis- 
eases with similar symptoms is the one from which the patient is 
suffering, by a systematic comparison and contrasting of the clini- 
cal findings, syn differentiation (2). 

d. by exclusion, a d. made by excluding those diseases to which 
only some of the patient's symptoms might belong, leaving one 
disease as the most likely d., although no definitive tests or 
findings establish that d. 

laboratory d., a d. made by a chemical, microscopic, microbio- 
logic, immunologic, or pathologic study of secretions, discharges, 
blood, or tissue. 

neonatal d., systematic evaluation of the newborn for evidence of 
disease or malformations, and the conclusion reached, 
pathologic d., a <L, sometimes postmortem, made from an ana- 
tomic and/or histologic study of the lesions present 
physical d., (1) a d. made by means of physical examination of 
the patient (2) the process of a physical examination, 
prenatal d., d. utilizing procedures available for the recognition 
of diseases and malformations in utero, and the conclusion 
reached, syn antenatal d. 

di ag nos tic (di-ag-nos'tik). 1. Relating to or aiding in diagnosis. 
2. Establishing or confirming a diagnosis. 

di ag nos ti cian (dfag-nos-tish'an). One who is skilled in mak- 
ing diagnoses; formerly, a name for specialists in internal medi- 
cine. 



Diagnostic and Statistical Manual of Mental Disorders 
(DSM). A system of classification, published by the American 
Psychiatric Association, that divides recognized mental disorders 
into clearly defined categories based on sets of objective criteria. 
Representing a majority view (rather than a consensus) of hun- 
dreds of contributors and consultants, DSM is widely recognized 
as a diagnostic standard and widely used for reporting, coding, 
and statistical purposes. 

The first edition (1952), based on the sixth revision of the 
International Classification of Diseases (/CD-6), was in- 
tended to promote uniformity in the naming and reporting 
of psychiatric disorders. It contained definitions: of all 
named disorders, but no sets of diagnostic criteria. While 
its classification of mental disorders showed the influence 
of Freudian psychoanalysis, its nomenclature (e.g., depres- 
sive reaction, anxiety reaction, schizophrenic reaction) re- 
flected the theories of Adolf Meyer (1866-1950). The 
second edition (DSM-JI, 1968) preserved the psychoana- 
lytic orientation but dropped the "reaction" terminology. 
The. third edition (DSM-III. 1980) abandoned much of the 
rigidly psychodynamic thinking of the earlier editions and, 
for the first time, provided explicit diagnostic criteria and 
introduced a multiaxial system whereby different aspects 
of a patient's condition could be separately assessed. 
Briefly stated, the axes are I, clinical disorders; H, person- 
ality disorders and mental retardation; IH, general medical 
disorders; IV, psychosocial and environmental stressors; 
and V, overall level of functioning. A revised version of 
the third edition (DSM-IIIR, 1987) incorporated a number 
of improvements and clarifications. The fourth edition 
{DSM-IV) appeared in May, 3994. It follows its two prede- 
cessors closely in general outline, and like them is coordi- 
nated with and partly derived from /CD- 9. For many ob- 
servers, the most significant change in DSM-IV is the 
renaming of the category formerly called "Organic Mental 
Syndromes and Disorders" as "Delirium, Dementia, and 
Amnestic and Other Cognitive Disorders," a shift in termi- 
nology intended to avoid the implication that mental disor- 
ders in other categories are not organic. 

di a gram. A simple, graphic depiction of an idea or object, 
Dieuaide d., syn triaxial reference system. 



flow (L, a d. composed of blocks connected by arrows represent! 
ing steps in a process such as decision analysis. 
□ Verm d., pictorial representation of the extent to which two ; ^l 
more quantities or concepts are mutually inclusive and exclusive,! 
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di a ki ne-sis (df &-ki-n5'sis). Final stage of prophase in meiosis ijHf 
in which the chiasmata present during the diplotene stage disap- 
pear, the chromosomes continue to shorten, and the nucleolus and;! 
nuclear membrane disappear. [G. dia, through, + kinesis, movi^ 
ment] 

dial (dfal, dll). A clock face or instrument resembling a clocfc$I 
face. [L. dies, day] ■.• 
astigmatic d., a diagram of radiating lines, used to test for asu#;|f 
matism. 

Di-a-lis-ter (aT-31-is'ter). An obsolete name for a genus of bacte-^ 
ria, the type species of which, D. pneumosintes, is now placed h)ff| 
the genus Bacteroides. . - • %| 

diallyl (dl-aTil). A compound containing two allyl groups. gf 

di al y sance (di-aTi-sans). The number of milliliters of bloodj 
completely cleared of any substance by an artificial kidney or by$ 
peritoneal dialysis in a unit of time; conventional clearance fort! 
mulas are expressed as mm/min. [tr. dialysis] ; ^:§f 

di al y sate (dT-al'i-sat). That part of a mixture that passes 1 
through a dialyzing membrane; the material that does not pas£|| 
through is referred to as the retentate, syn diffusate. :.';M 

di al y sis (dl-aTi-sis). 1. A form of filtration to separate crystal- 
loid from colloid substances (or smaller molecules from largccp 
ones) in a solution by interposing a semipermeable merabra^|g| : 
between the solution and dialyzing fluid; the crystalloid (smalfcr)|» 1 
substances pass through the membrane into the dialyzing fluid'pajg 
the other side, the colloids do not. 2. The separation of substance*! 
across a semipermeable membrane on the basis of particle su*i^ 
and/or concentration gradients. 3. A method of artificial Kidney^ 
function. [G. a separation, ft. dialyo, to separate] 
continuous ambulatory peritoneal d. (CAPD), method of 
toneal d. performed in ambulatory patients with influx and efflMJF^ 
of dialysate during normal activities. . 
equilibrium d., in immunology, a method for determioationy^M j 
association constants for hapten- antibody reactions in a system$|fi 
which the hapten (dialyzable) and antibody (nondialyzable) solu- 
tions are separated by semipermeable membranes. Since at cqupj^f 
librium the quantity of free hapten will be the same in the 
compartments, quantitative ^terminations can be made of baj% 
ten-bound antibody, free antibody, and free hapten. 
extracorporeal d., hemodialysis performed through an apparatus, 
outside the body. 

peritoneal d., removal from the body of soluble substances aoS 




therapeutic i., the ratio of LD50 to ED^ used in quantitative 
comparison of drugs. 

thoracic i., anteroposterior diameter of the thorax times 100 di- 
vided by the transverse diameter of the thorax, syn chest i, 
tibiofemoral i. t the ratio obtained by multiplying the length of the 
tibia by 100 and dividing by the length of the femur, 
transversovertical 1., syn vertical i. 

tuberculoopsonic i., the opsonic i. calculated in relation to tuber- 
culous infection, with an actively growing culture of Mycobacte- 
rium tuberculosis or the strain of tubercle bacillus from the pa- 
tient being used in the test. 

ultraviolet U a daily i. issued by the U.S. National Weather 
Service for many cities, forecasting the amount of dangerous 
ultraviolet light that will arrive at the earth's surface about noon 
the following day. 

uricolytic i., the percentage of uric acid oxidized to allantoin 
before being secreted. 

vertical the relation of the height to the length of the skull: 
(height x lOOyiength. syn height-length i., length-height i., trans- 
versovertical i. 

vital i., the ratio of births to deaths within a population during a 
given time. 

Volpe-Manhold i. (V-MI), an index for comparing the amount of 
dental calculus in individuals. 

volume \. y an indication of the relative size (e.g., volume) of 
erythrocytes, calculated as follows: hematocrit value, expressed as 
per cent of normal +- red blood cell count, expressed as per cent of 
normal = volume i. 

zygomaticoauricular I., the ratio between the zygomatic and the 
auricular diameters of the skull or head. 



in di can (in'di-kan). 1. Indoxyl P-D-glucoside from Indigofera 
species and Polygonium tinctorium; a source of indigo, syn plant 
i. 2. 3-Indoxylsulfuric acid, a substance found (as its salts) in 
sweat and in variable amounts in urine; indicative, when in quan- 
tity, of protein putrefaction in the intestine (indicanuria). syn 
metabolic i., uroxanthin. 
metabolic i., syn indican (2). 
plant i., syn indican (1). 

in di can i dro sis (in'di-kan-i-dro'sis). Excretion of indican in 
the sweat, [indican + G. hidros, sweat] 

in di cant (in'di-kant). 1. Pointing out; indicating. 2. An indica- 
tion; especially a symptom indicating the proper line of treatment 
[L. in-dico, pres. p, -arts (-ant), to point out] 

in di can u ria (in'di-kan-u're-&). An increased urinary excretion 
of indican, a derivative of indol formed chiefly in the intestine 
when protein is putrefied; indol is also formed during the putre- 
faction of protein in other sites. 

in-di-ca tion (in-di-kS'shfln). The basis for initiation of a treat- 
ment for a disease or of a diagnostic test; may be furnished by a 
knowledge of the cause (causal i.), by the symptoms present 
(symptomatic L), or by the nature of the disease (specific I.). [L. 
fr. in-dico, pp. -atus, to point out, fr. dico, to proclaim] 
off label f., use of a medication for a purpose Other than that 
approved by the FDA. 

in di ca tor (in'di-ka-ter, -tor). 1. In chemical analysis, a sub- 
stance that changes color within a certain definite range of pH or 
oxidation potential, or in any way. renders visible the completion 
of a chemical reaction; e.g., litmus, phenolsulrbnphthalein. 2. An 
isotope that is used as a tracer. 3. The labeled substance whose 
distribution between reactants of a system is used to determine the 
amount of analyte present [L. one that points out] 
alizarin t, a solution consisting of 1 g sodium alizarin sulfonate 
dissolved in 100 mL distilled water, used as an i. for free acidity 
in gastric contents. 

clinical 1., a measure, process, or outcome used to judge a particu- 
lar clinical situation and indicate whether the care delivered was 
appropriate. 

health i., variable, susceptible to direct measurement, that reflects 
the state of health of persons in a community. 



oxidation-reduction i-, a substance that undergoes a definite coj^ 
or change at a specific oxidation potential, syn redox i. ■J:. f& , 
redox L, syn oxidation-reduction i, 

in di ces (in'di-sez). Alternative plural of index. 

In did la (in-de-el'5). Old name for Madurella. 

in dig e* nous (in-dij'S-nus). Native; natural to the country 0r 
region where found. [L. indigenus, born in fr. indu, within (otf.-S 
form of in), + G. -gen, producing] :.'(|| 

in di ges tion (in-di-jes'chiln). Nonspecific term for a variety of)^ 
symptoms resulting from a failure of proper digestion and absorp. 'M 
tion of food in the alimentary tract. : , 
acid i„ i. resulting from hyperchlorhydria; often used by the laity 
as a synonym for pyrosis, 
fat i., syn steatorrhea, 
gastric L, syn dyspepsia, 
nervous i. caused by emotional upsets or stress. 

in di go (in'dJ-go) [C.I. 73000]. A blue dyestuff obtained from ■$ 
Indigofera tinctoria, and other species of Indigofera (family 
teguminosae); also made synthetically, syn indigo blue, indigo. 
tin. (L. indicum, fr. G. indikon, indigo, ntr. of Indikos, Indian] 

in di go bine, syn indigo. '•. j|| 

in di go carmine [C.l. 73015]. A blue dye used for 

measure- $ 

ment of kidney function and as a special stain for Negri bodies, 
syn sodium indigotin disulfonate. 

in dig o tin (in-dig'o-tin, in-di-g5'tin). syn indigo. 

in di go u ria, in di gu ria (in'dJ-go-u're-a\ in-di-goo're-a), The 
excretion of indigo in the urine. ■ 

in dis po si tion (in-dis-po-zish'un). Illness, usually slight; mak 
aise. [L. in neg. + dispositio, an arrangement, fr. dis-pono, pp;^l 
-positus, to place apart] 

in di um (In) (in'de-om). A metallic element, atomic no. 49^^ 
atomic wt 114.82. [indigo, because of its blue line in thespecsH 
trum] \ 

in di um-111 ( 111 In). A cyclotron-produced radionuclide with'a>i| 
half-life of 2.8049 days and with gamma ray emissions of 17 1.2; 
and 245.3 kiloelectron volts. In a chloride form, it is used as a:f 
bone marrow and tumor-localizing tracer; in a chelate form, as a 
cerebrospinal fluid tracer. It is also used as a white blood cell ...j 
labeling agent and as an antibody label. .yYraj 
I. chloride, i. trichloride, Cl 3 In; used in electron microscopy .to:" 
stain nucleic acids in thin tissue sections. 

in di um-113m ( 113n, In). A radioactive isomer of I13 In; it has a -J| 
half-life of 1.658 hours; it has been used in cisternography and as " 
a diagnostic aid in cardiac output. 

in di vid u a tion (in'di-vid-u-a'shi3n). 1. Development of faQi 
individual from the specific. 2. In jungian psychology, the process;|| 
by which one's personality is differentiated, developed, and ei; g 
pressed- 3. Regional activity in an embryo as a response to ao:g 
organizer. 

in do cy a nine green (in-do-sf &-nen). A tricarbocyanine djfcv| 
that binds to serum albumin and is used in blood volume determijjt| 
nations and in liver function tests. 

in do cy bin (in-d6-sfbin). syn psilocybin. 

In dol ac e tu ria (in'dol-as-e-too're-5). Excretion of an appred; . 
able amount of indoleacetic acid in the urine; a manifestation «g| 
Haitnup disease, also seen in patients with carcinoid tumors, 

in dol a mine (in-dol'a-men). General term for an indole or"* 
dole derivative containing a primary, secondary, or tertiary annpf 
group (e.g., serotonin). ' : 'fW$ 

in dole (in'dol). 1. 2,3-Benzopyrrole; basis of many bmlog^|§ 
active substances (e.g., serotonin, tryptophan); formed in de #j^S| 
tion of tryptophan, syn ketole. 2. Any of many alkaloids coDW^gg 
ing the i. (1) structure. . ; S$f 

in do lent (in'do-lent). Inactive; sluggish; painless or ne ^^§l 
said of a morbid process. [L. in- neg. + doleo, pr. p. dolens ("^gpg 
to feel pain] 'J$$$k 

in dol ic acids (in-dol'ik). Metabolites of L-tryptophan 
within the body or by intestinal microorganisms; the P" 11 , 01 ^^^ 
encountered in urine are indoleacetic acid, mdoleacetylgl u,aIff 
5-hydroxyindoleacetic acid, and indolelactic acid. 
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difficult . for patients to make informed choices about their care. Pa- 
tients should be told who is providing care, what benefits and burdens 
can be attributed to trainees, and how trainees are supervised; Most 
patients, when informed, allow trainees to play an active role in their 
care. : .- ! ■ :.: 

IMPAIRED PHYSICIANS Physicians may hesitate to inter- 
vene when colleagues impaired by alcohol abuse, drug abuse, or psy- 
chiatric or medical illness place patients at risk. However, society re- 
lies on physicians to regulate themselves. If colleagues of an impaired 
physician do not take steps to protect patients, no one else may be in 
a position to do so. • 

CONFLICTS FOR TRAINEES Medical students and resi- 
dents may fear that they will receive poor grades or evaluations if they 
act on the patient's behalf by disclosing mistakes, avoiding misrep- 
resentation of theirrole; and reporting impaired colleagues. Discussing 
such dilemmas with more senior physicians can help trainees check 
their interpretation of the situation and obtain advice and assistance. 

ADDITIONAL ETHICAL ISSUES • a 

MAINTAINING CONFIDENTIALITY Maintaining the con- 
fidentiality of medical mformaUon respects patients* autonomy and 
privacy, encourages them to. seek treatment and to discuss their prob- 
lems candidly, and prevents discrimination. Physicians need to guard 
against inadvertent breaches of confidentiality, as when talking about 
patients in elevators. Maintaining confidentiality is not an absolute 
rule. The law may require physicians to override, confidentiality in 
order to protect third parties, for example, reporting to government 
officials persons with specified infectious conditions, such as tuber- 
culosis and syphilis; persons, with gunshot wounds; and victims of 
elder : abuse and dpmesde. violence. Computerized , medical records 
raise additional concerns because breaches of confidentiality may. af- 
fect many patients, 

ALLOCATING RESOURCES JUSTLY Allocation of lim- 
ited health care resources is ( problematic. IdeaUy, allocation decisions 
should be made as public policy, with physician input. At the bedside, 
physicians generally should act as patient advocates within constraints 
set by society, reasonable insurance; coverage*, and sound practice. Ad 
hoc rationing by me individual' physician at the bedside may be in- 
consistent, discriminatory, and ineffective. In some cases,, however, 
two patients may compete, for the same limited resources, such as 
physician time or a bed in intensive care. When this, occurs, physicians 
should ration their time and resources according to patients' medical 
needs and the probability of benefit. .. , . ^ 

ASSISTANCE WITH ETHICAL ISSUES Discussing per- 
plexing ethical issues with other members of the health care team, 
colleagues, or the .hpspital ethics committee often clarifies issues and 
suggests ways to improy e^ornmuru^ation and to deal with strong emo- 
tions. When struggling with difficult ethical issues* physicians, may 
need, to reevaluate their -basic convictions, tolerate uncertainty, and 
maintain their integrity while respecting the opinions of others. 
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To the medical student who requires 2 h to collect a patient's history 
and perform a physical examination, and several additional hours to 
organize them into a coherent presentation, the experienced clinician's 
ability to reach a diagnosis and decide on a management plan in a 
fraction of the time seems extraordinary. While medical knowledge 
and experience play a significant role in the senior clinician's ability 
to arrive at a differential diagnosis and plan, quickly, -much of the 
process involves skill in clinical decision-making. The first goal of this 
chapter is to provide an introduction to the study of clinical reasoning. 

Equally bewildering to the student are the proper use of diagnostic 
tests and the integration of the results , into the clinical assessment. The 
novice medical practitioner typically uses a "shotgun" approach to 
testing, hoping to a hit a target without knowing exactly what that 
target is. The expert, on the other hand, usually has a specific target 
in mind and effidehtly . adjusts, the testing strategy to it. The; second 
goal of this chapter is to review briefly some of the crucial basic sta- 
tistical concepts that govern .the proper interpretation and use of di- 
agnostic tests; quantitative tools available to assist in clinical decision- 
making will also be discussedJ i\ 

CLINICAL DECISION-MAKING 

. CLINICAL REASONING The most important clinical actions 
are not procedures or prescriptions but the judgments from which all 
other aspects of clinical medicine flow. In the modem .era of large 
randomized trials, it is easy to overlook me importance of this elusive 
mental activity and focus instead on the algorithmic practice guidelines 
constructed to improve care. One, reason for this apparent neglect is 
that much more research has been done on how doctors should make 
decisions (e.g. , using a Bayesian model discussed below) than on how 
they actually do. Thus, much of what we know about clinical reasoning 
comes from empirical studies, of nonmedical problem-solving be- 
havior. ■ 

Despite the great technological advances of the twentieth century, 
uncertainty still plays a pi votal role in all aspects of medical decision- 
making. We may know that a patient does not have long to live, but 
we cannot be certain how long. We may prescribe, a potent new re- 
ceptor blocker to reverse the course of a patient's illness, but we cannot 
be certain that .the therapy will do so without side effects. Uncertainty 
in medical outcomes creates the need for probabilities and other math- 
ematical/statistical tools to help guide decision-making. (These tools 
are reviewed later in the chapter.) 

Uncertainty is compounded by the information overload that char- 
acterizes modern medicine. Today's experienced clinician needs close 
to 2 million pieces of information to practice medicine. Doctors sub- 
scribe to an average of 7 journals, representing over 2500 new articles 
each year. Computers offer the obvious solution both for management 
of information and for better quantitation and management of the daily 
uncertainties of medical care. While the technology, to computerize 
medical practice is available, many practical problems remain to be 
solved before patient information can be standardized and integrated 
with medical evidence on a single electronic platform. . 
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reduces the likelihood of hyperthyroidism in a patient with paroxysmal 
atrial fibrillation. 

While the representativeness and availability heuristics may play 
the major roles in shaping early diagnostic hypotheses, the acuity of 
a patient's illness can also be very influential For example, clinicians 
are taught to consider aortic dissection routinely as a possible cause 
of acute severe chest discomfort along with myocardial infarction, 
even though the typical history of dissection is different' from, myo- 
cardial infarction and dissection is far less prevalent (Chap. 247). This 
recommendation is based on the recognition that a relatively rare but 
catastrophic diagnosis like aortic dissection is very difficult, to. make 
unless it is explicitly considered; If the clinician fails to elicit any of 
the characteristic features of dissection by history and finds equivalent 
blood pressures in both arms and no pulse deficits, he or she. may feel 
comfortable in discarding.the aortic dissection hypothesis. If, however, 
the chest x-ray shows a widened mediastinum, the hypothesis may be 
reinstated and a diagnostic test ordered [e.g., thoracic computed to- 
mography (CT) scan, transesophageal echocardiogram) to evaluate it 
more fully. In noncritical situations, the prevalence of potential alter- 
native diagnoses should play a much more prominent role in diagnostic 
hypothesis generation. The value of conducting a rapid systematic clin- 
ical survey of symptoms and organ systems to avoid.missing important 
but inapparent clues cannot be overstated. 

Because the generation and evaluation of appropriate diagnostic 
hypotheses is a skill that not all clinicians possess to an equal degree, 
errors in this process can occur, and in the patient with, serious acute 
illness these may lead to tragic consequences. Consider the following 
hypothetical example, A 45-year-old male patient with a 3-weel^ his- 
tory of a "flulike'* upper respiratory infection (URI) presented to his 
physician with symptoms of dyspnea and a productive, cough. Based 
on the presenting complaint, the clinician pulled, put a "URI Assess- 
ment Form" to improve quality and. efficiency of carp. The physician 
quickly completed the examination components outlined on this struc- 
tured form, noting in particular the absence of fever and a clear chest 
examination. He then prescribed an antibiotic for presumed bronchitis, 
showed the patient how to breathe into a paper bag to relieve rus 
"hyperventilation," and sent him home with me reassurance that his 
illness was not serious. After a sleepless night with significant dyspnea 
unrelieved by rebreathing into a bag, the patient developed nausea and 
vomiting and collapsed. He was brought into the Emergency Depart- 
ment in cardiac arrest and could riot be resuscitated. Autopsy showed 
a posterior wall myocardial infarction and a fresh thrombus in ah 
atherosclerotic right coronary artery What went wrong? The clinician 
decided, even before starting the history, that the patient's complaints 
were not serious. He therefore felt confident that he could perform an 
abbreviated and focused examination using the URI assessment pro- 
tocol rather than considering the full range of possibilities and per- 
forming appropriate tests to Confirm or refute his initial hypotheses: 
In particular, by concentrating on the "URI," the clinician failed to 
elicit the full dyspnea history, which would have suggested a far more 
serious disorder, and did hot even search for other symptoms mat could 
have directed him to the correct diagnosis 

This example illustrates how patients can diverge from textbook 
symptoms and the potential consequences of being unable to adapt the 
diagnostic process to real- world challenges. The expert, while recog- 
nizing that common things occur commonly; approaches bach evalu- 
ation on high alert for clues that the initial diagnosis may be wrong. 
Patients often provide information that "does- not fit" with any of the 
leading diagnostic hypotheses being considered: Distrnguishing : real 
clues from false trails can only be achieved by practice and experience. 
A less experienced clinician who tries to be too efficient (as in the 
above example) can make serious judgment errors: - 

MAJOR INFLUENCES ON CLINICAL DECISION- 
MAKING More than a decade of research on variations in clinician 
practice patterns has shed much light on forces that shape clinical 



decisions. The use of heuristic "shortcuts*" as detailed above, provides 
a partial explanation, but several other key factors play an important 
role in shaping diagnostic hypotheses, and management decisions. 
These factors can be grouped conceptually into three overlapping cat- 
egories: (1) factors related to physician personal characteristics and 
practice style, (2) factors related to the practice setting, and (3) eco- 
nomic incentive factors. 

Practice Style Factors pne'of the key roles of the physician in 
medical care is to serve as the patient's agent to ensure that necessary 
care is provided at a high level of quality. Factors that influence this 
role include the physician's knowledge, training, and experience. It is 
obvious that physicians cannot practice evidence-based medicine if 
they are unfamiliar with the evidence. As would be expected, special- 
ists generally Imow, the evidence in their field better than do general- 
ists. Surgeons may be more enthusiastic about recommending surgery 
than medical doctors because their belief in the beneficial effects of 
surgery is stronger, For the same reason, invasive cardiologists are 
much more likely to refer chest pain patients for diagnostic catheter- 
ization than are noninvasive cardiologists or generalise. The physician 
beliefs that drive these different, practice, styles are based on personal 
experience, recollection, and interpretation of the available medical 
evidence. For example, heart failure specialists are much more likely 
than generalists to achieve target angiotensin-converting enzyme 
(ACE) inhibitor therapy in their heart failure patients because they are 
more familiar with what the targets are (as defined by large clinical 
trials), have more familiarity with the specific drugs (including dosages 
and side effects), and are less likely to overreact to foreseeable prob- 
lems in therapy such as a rise in creatinine levels or symptomatic 
hypotension. Other intriguing research has shown a wide distribution 
of acceptance times of antibiotic therapy for peptic ulcer disease fol- 
lowing widespread dissemination of the"evidence ? * in the medical 
literature. Some gastroenterologists accepted this new therapy before 
the evidence was clear (reflecting, perhaps, an aggressive practice 
style), and some gastroenterologists lagged behind (a conservative 
practice style, associated in this case with, older physicians).' As a 
group, internists lagged several years behind gastroenterologists. 

The opinion of influential leaders can also have an important effect 
on practice patterns: Such influence can occur at both the national level 
(e.g., expert physicians teaching at national meetings) and the local 
level (e.g., local educational programs, "curbside consultants"). Opin- 
ion leaders do not have, to be physicians. When conducting rounds 
with clinical pharmacists, physicians are less likely to make medication 
errors and more likely to use target levels of evidence-based therapies. 

The patient's welfare is not the only concern that drives clinical 
decisions. The physician's perception about the risk of a malpractice 
suit resulting from either an erroneous decision or a bad outcome cre- 
ates a style of practice referred to as defensive medicine. This practice 
involves using tests and therapies with very small marginal returns to 
preclude future criticism in the event of an adverse outcome. For ex- 
ample, a 40-year-old woman who presents with a long-standing history 
of intermittent headache and a new. severe headache along with a nor- 
mal neurologic examination has a very low likelihood of structural 
intracranial pathology. Performance of a head GT or magnetic reso- 
nance imaging (MRI) scan in this situation would constitute defensive 
medicine. On the other hand, the results of the test could provide 
reassurance to an anxious patient. ■• 

Practice Setting Factors Factors in this category relate to the 
physical resources available to the physician's practice and the practice 
environment. Physician-induced demand is a term that refers to the 
repeated observation that physicians have a remarkable ability to ac- 
commodate to and employ the medical facilities available to them. A 
classic early study in this area showed that physicians in Boston had 
an almost 50% higher hospital admission rate than did physicians in 
New Haven, despite there being no obvious differences in the health 
of the cities- 1 inhabitants. The physicians in New Haven were not aware 
of using fewer hospital beds for their patients, nor were the Boston 
physicians aware of using less stringent criteria to admit patients. 
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Table 3-2 Measures of Disease Probability ;: 



Pretest probability of disease = probability of disease before test is done; 
may use population prevalence of disease or more patient-specific data to 
generate this probability estimate. 

Posttest probability of disease = probability of disease accounting for both 
pretest probability and test results; also called predictive value of the test. 



Baycs* theorem 
Computational version: 

Posttest probability = 



Pretest probability X test sensitivity ..... 

Pretest probability X test sensitivity + 
(1 - disease prevalence) X' test false-positive rate 



Example [with a pretest probability of 0.50 and a "positive" diagnostictest 
result (test sensitivity =» 0.90, test specificity = 0.90)]: 

Posttest probability ~ . 

(0.50X0.90) + (0.50X0.10) 

. -0.90 ..... , .... 



positive test, which is the ratio of the taxe-positiye rate to the false- 
positive rate [or sensitivity/(l -r specificity)]. For example* a test with 
a sensitivity of 0,90 and a specificity of 0.90 has a likelihood ratio of 
0.90/0 -0.90), or 9. Thus, for this hypothetical test, a "positive" 




Posttesf 
probability 



Likelihood ratio; 
- • Sensitivity 
■ 1 - Specificity 



Pretest' 
probability' 



FIGURE 3-1 tomogram version of Bayes* theorem used to predict the post- 
test probability of disease (left-hand scale) using the pretest probability of dis- 
ease (right-hand scale) arid the likelihood ratio for a positive test (middle scale). 
The likelihood ratio is calculated as the sensitivity/(l — specificity). To use, 
place a straight edge rohnectirig the pretest probability and the iikemiood ratio, 
and read off the posttest probability: This figure illustrates the value of a pos- 
itive exercise treadmill test (likelihood ratio 4) and a positive exercise thallium 
SEECT. study (likelihood ratio 9) in the patient with a pretest probability of 
coronary artery disease of 50%. Treadmill results shown in solid line; thallium 
results in dashed line. ( Adapted from Fagan 77; N EnglJ Med 293:257, 1975.) 



result is 9 times more likely in a patient with the disease titan in a 
patient without it. The more accurate the test, the higher the livelihood 
ratio. However, if sensitivity is excellent but specificity is less so, the 
likelihood ratio will be substantially reduced (e.g.,' with a 90% sensi- 
tivity but a 60% specificity, the likelihood ratio is 2.25), Most tests in 
medicine have likelihood ratios for a. positive result between 1.5 
and 20. 

Consider two tests commonly used in the diagnosis of CAD, an 
exercise treadmill and an exercise thallium -201 single photon emission 
CT (SPECT) test (Chap. 244). Meta-analysis has shown the treadmill 
to have an average sensitivity of 66% and aii . average , specificity of 
84%, -yielding a likelihood ratio of 4.1 [0.66/(1 - 0.84)]. If we use 
this test on a patient with a pretest probability of CAD of 10%, the 
posttest probability of disease following a positive result rises only to 
about 30%. If a patient with a pretest probability of CAD of 80% has 
a positive test result, the posttest probability of disease is about 95%. 

The exercise thallium SPECT test is a more accurate test for the 
diagnosis of CAD. For our purposes^ assume that it has both a sensi- 
tivity and specificity of 90%, yielding a likelihood ratio of -9.0 [0.90/ 
{1 - 0.90)]. If we again test our low pretest probability patient and he 
has a positive test, using Fig. 3-1 we can demonstrate that the posttest 
probability of CAD rises from 10 to 50%. However, from a decision- 
making point of view, the more accurate test has not been able to 
improve diagnostic confidence enough to change management. In fact, 
the test has moved us from being fairly certain that the patient did not 
have CAD to being completely undecided {a 50:50 chance of disease), 
hi a patient with a pretest probability of 80%, using the more accurate 
thallium SPECT test raises the posttest probability to 97% (compared 
with 95% for the exercise treadmill). Again, the more accurate test 
does not provide enough improvement in posttest confidence to alter 
management, and neither test has improved much upon what was 
known from clinical data alone. 

If the pretest probability is low (e.g., ^20%), even a positive result 
on a very accurate test will not move the posttest probability to a range 
high enough to rule in disease (e.g., >80%). Conversely, with a high 
pTetest probability, a negative test will not adequately rule out disease. 
Thus, the largest gain in diagnostic confidence from a test occurs when 
the clinician is most uncertain before performing it (e.g., pretest prob- 
ability between 30 and 70%). For example, if a patient has a pretest 
probability for CAD of 50%, a positive exercise treadmill test will 
move the posttest probability. to 80% and a positive exercise thallium 
SPECT test . will move it to . 90% (Fig. 3-1). 

B ayes' theorem, as presented above, employs a number of impor- 
tant simplifications that should be considered. First, few tests have 
only two useful outcomes, positive or negative, and many tests provide 
numerous pieces of data about the patient. Even if these can be inte- 
grated into a summary result, multiple levels of useful information 
may be present (e.g.; strongly positive, positive, indeterminate* nega- 
tive, strongly negative). While B ayes' theorem can be adapted to this 
more detailed test result format, it is computationally complex to do 
so. Second, Bayes' theorem assumes that the information from the test 
is completely unique and nonoverlapping with information used to 
estimate the pretest probability. This independence assumption, how- 
ever, is often wrong. In many cases, test results are correlated with 
patient characteristics. For example, the findings of cardiomegaly and 
pulmonary, edema on chest x-ray are correlated with the. historic fea- 
tures of heart failure and with the physical findings of a displaced left 
ventricular apical impulse, an S 3 gallop, and rales. .The unique predic- 
tive information contributed by the test in this case (the chest x-ray) 
is only a fraction of its total information because: much had already 
been learned about the probability of heart failure before the test was 
done. 

Finally , it has long been thought that sensitivity and specificity are 
prevalence-independent parameters of test accuracy, and many texts 
still make this assertion. This statistically useful assumption, however, 
is clinically wrong. For example, a treadmill exercise test has a sen- 
sitivity in a population of patients with one-vesselCAD of. around 
30%, whereas the sensitivity in severe three-vessel CAD approaches 
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each strategy, there is a risk that the patient will deyelop endocarditis 
with or without major complications. In this analysis, the longest qual- 
ity-adjusted survival (5.47 quality-adjusted life-years) was associated 
with the 4-week antibiotic course strategy, which also had the highest 
costs ($14,136 per patient), whereas the lowest costs ($9830 per pa- 
tient) and worst outcomes (5.42 quality-adjusted Ufe-years) were as- 
sociated with the 2-week antibiotic course strategy. From a clinical 
point of view (ignoring costs), the 4-week antibiotic course was best. 
From a cost-effectiveness point of view; the TEE strategy (5.46 qual- 
ity-adjusted liferyears and $10,05.1 per patient costs) provided the best 
balance of added benefits and costs. Jhus, decision analysis can be 
extremely helpful in clarifying tradeoffs in outcomes and costs in dif- 
ficult management areas such as the. above. where it is highly unlikely 
that an adequate randomized trial will ever be done. 

The data needed to t fill in . a decision tree (Fig. 3-2) are typically 
cobbled together from a variety of sources, including the literature 
(randomized trials, meta-analyses, observational studies) and expert 
opinion. Once the decision tree is finished, the decision is ''analyzed*' 
by calculating the average value of each limb of the nee. The decision 
arm with the highest net value (or expected utility) is (he preferred 
choice! The value of this exercise, however, is not so much in develr 
oping a prescription for action as it is in exploring the key elements 
and pressure points of a complex or difficult decision. The process of . 
building, the decision tree forces the analyst to be explicit about the 
choices being considered and all their relevant outcomes. Areas of high 
uncertainty are readily identified. Sensitivity analyses are an integral 
part of decision analysis and involve systematically varying the value 
of each key parameter in the model alone (one-way sensitivity anal- 
ysis,) in pairs (two-way), or in higher combinations (multivariate) to 
assess the impact on choice of preferred management strategy. In the 
above example, varying the incidence of endocarditis resulting from. 
S. aureus. bacteremia from 3% to over 50% had no impact. on. the 
choice of TEE as the preferred strategy. 

User -friendly personal computer-based software packages now 
make the creation and analysis of decision trees much more straight- 
forward than in the past. However, the process is still too cumbersome 
and time-constmiing to be used on a routine basis. When medicine is 
practiced from a fully computerized platform, a library of prestructured 
decision trees with user modifiable values can be made available to 
support practitioners working, with individual patients. • 

conclusions' 
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In this era of evidence-based medicine, it is tempting to think that all 
the difficult decisions practitioners face have been or soon will be 
solved and digested into practice guidelines and computerized re- 
minders. For the foreseeable future; however, such is not the case. 
Meta-analyses cannot generate evidence where there are no adequate 
randomized trials, and most of what clinicians face will never be thor- 
oughly tested in a randomized trial. Excellent clinical reasoning skills 
and experience supplemented by well-designed quantitative tools and 
a keen appreciation for individual patient preferences will continue to 
be of paramount importance in the professional life of medical prac- 
titioners for years' to come, " : 
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The United States has the distinction of having some of the best med- 
ical care of any technologically advanced country: We have many of 
the best hospitals and doctors in the world. The research pipeline is 
full of significant new therapeutic advances, with revolutionary ge- 
netic-based therapies perhaps only a decade away. Our citizens largely 
subscribe to the principle that excellent medical -care should be avail- 
able to all, regardless of ability to pay. Yet we also have over 43 
million people (most of them employed and earning minimal wages) 
without any health insurance and many more who are inadequately 
insured. Since the collapse of the Clinton health care reform efforts in 
1994, U.S. health policy has been directed by marketplace forces that 
have created powerful and sometimes perverse incentives in medicine: 
Health insurance companies that use every available means to avoid 
insuring sick people; "managed care" programs that really only man- 
age costs; doctors who are provided incentives to provide less medical 
care; and'pharmaceutical companies that develop powerful and expen- 
sive new drugs priced beyond the reach of many of the elderly and 
chronically ill who need them most. 

Facing such powerful and chaotic forces, physicians tend to focus 
narrowly on what they are most comfortable with, taking care of in- 
dividual patients audi conducting academic investigations. Many doc- 
tors consider economics too arcane for them to grasp and therefore do 
not even try. Consequently, when presented with economic arguments 
and evidence they are often unable to discriminate the legitimate from 
the fallacious. More importantly, they are ill equipped to defend their 
patients' interests in the crucible of cost containment that characterizes 
the modem managed care 1 era. 

This chapter has two goals: first, to provide a brief introduction to 
some of the larger economic forces that shape modem medical prac- 
tices, and second, to* introduce the economic tools that are used for 
assessing the value of medical practices, including cost effectiveness 
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HEALTH CARE SPENDING AND FINANCING 

HOW MUCH IS SPENT ON HEALTH CARE? In 1997, 
the United States spent $1.1 trillion on its health, care system, repre- 
senting 13.5% of the gross domestic product (GDP) (a crude measure 
of national income). Most of this ($969 billion) was spent on personal 
health care: 34% went to hospitals, 20% to physicians, 7% to nursing 
homes, and 8% to outpatient pharmaceuticals. In comparison, Canada 
and Western- European countries-spend a substantially smaller portion 
(6. to 10%) of their national income on health care but their citizens 
appear to be equally healthy, at least by crude metrics such as life 
expectancy and infant mortality rates. Economists.and politicians have 
for years used such data to ; argue that the United States spends too 
much on health care. The issue of how much to spendis an inherently 
political one, however, and the discipline of economics has. little to 
say about it. . 

WHO PAYS FOR HEALTH CARE? Two major factors 
are: continually driving up the costs of medical care: introduction into 
medical practice of new medical technologies (drugs, devices, proce- 
dures) that have a highprice tag, and the aging of the .U.S.. population 
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sincere concern, the willingness to take the time to explain all aspects 
of the illness, and a nonjudgmental attitude when dealing with patients 
whose cultures, lifestyles, attitudes, and values differ from those of 
the physician are just a few of the characteristics of the humane phy- 
sician. Every physician will, at times, be challenged by patients who 
evoke strongly negative (or strongly positive) , emotional responses. 
Physicians should be alert to their own reactions to such patients and 
situations and should consciously monitor and control their behavior 
so that the patients* best interests. remain the principal; motivation for 
their actions at all times, 

. : An important aspect of patient care involves an appreciation of the 
"quality of life," a subjective assessment of what each patient values 
most Such an assessment requires detailed, sometimes intimate 
knowledge of the patient, wfcch can usually be obtained only through 
deliberate, unhurried, and often repeated conversations. It is in these 
situations that the time constraints of a managed care setting 'may prove 
problematic. 

The famous statement of Dr. Francis Peabody is even more rele- 
vant today than when delivered more man three-quarters of a century ago 
The significance of the intimate personal relationship between phy- 
sician and patient cannot he too strongly emphasized, for in an 
extraordinarily large number of cases both the diagnosis and treat- 
ment are directly, dependent on it One of the essential qualities of 
the clinician is interest in humanity, for the secret of the care of 
the patient ism caring for the patient. 

CLINICAL SKILLS History Taking The written history of 
an illness should embody all the facts of medical significance in the 
life of the patient Recent events should be given the most attention 
The patient should, at some point, have the opportunity to tell his or 
her own story of ihe illness without frequent interrupUon and, when 
appropriate, receive expressions of interest encouragement, and em- 
pathy from the physician^ The physician must be alert to the possibility 
that any event related by the patient, however trivial or apparently 
remote, may be the key to the solution of the medical problem. 

An informative history is more than an orderly listing of symp- 
tom's; something is always gained by listening to patients and noting 
the way in which they describe their symptoms Inflections of voice, 
facial expression, gestures, and attitude may reveal important clues to 
the meaning of me symptoms to the patient iTaking history often in- 
volves much data gathering. Patients vary ui their medical sophisti- 
cation and ability to recall facts; Medical history should therefore be 
corroborated whenever possible. The family and social history can also 
provide important insights into the types of diseased that should be 
considered. In listening to the history, the physiciart discovers hoi only 
something about the disease but also something about the patient. The 
process of history taking provides an opportunity to observe the pa- 
tient's behavior and to watch for features to' be' pursued more thor- 
ougWy during the physical examination: 

The very act of eliciting the history provides trie 'physician with 
the opportunity to establish or enhance the unique bond that is the 
basis for the ideal patient-physician relationship; It is helpful to de- 
velop an appreciation of the patient's perception of the illness, the 
patient's expectations of the physician and the medical care system, 
and the financial and social implications of the illness to -the patient! 
The confidentiality of the patient-physician relationship should be em- 
phasized, and the patient should be given the opportunity to identify 
any aspects of the history that should not be disclosed, 

n Physical Examination Physical signs are objective indications 
of disease whose significance is enhanced when they confirm a func- 
tional or structural change already suggested by the patient's history. 
At times, however, the physical signs may. be the only evidence of 
disease. ..... 

The physical examination should be performed methodically and 
thoroughly, with consideration for the patient's comfort and modesty. 
Although attention is often directed by me history to the diseased organ 



or part of the body, the examination of a new patient must extend from 
head to toe in an objective search for abnormalities. Unless the phys- 
ical examination is systematic, important segments may be omitted. 
The results of the examination, like the details of the history, should 
be recorded at the time they are elicited, not hours later when they are 
subject to the distortions of memory. Skill in physical diagnosis is 
acquired with experience, but it is not merely technique that deter- 
mines success in eliciting signs. The detection of a few scattered pe- 
techiae, a faint diastolic murmur, or a small mass in the abdomen is 
not a question of keener eyes and ears or more sensitive fingers but of 
a mind alert to these findings. Since physical findings are subject to 
changes, the physical examination should be repeated as frequently as 
the clinical situation warrants. 

Laboratory Tests The availability of a wide array of laboratory 
tests has increased our reliance on these studies for the solution of 
clinical problems. The accumulation of laboratory oata does not.relieve 
the physician from the responsibility of careful observation, exami- 
nation, and study of the patient It is also 'essential to bear ui. mind the 
limitations of such tests. By virtue of their impersonal quality, com- 
plexity, and apparent' precision, they often gain an aura of authority 
regardless of the fallibility of the tests themselves, the instruments used 
in the tests, and the individuals performing or interpreting ! mem. Phy- 
sicians must weigh the expense involved^ the laboratory ^procedures 
they order relative to the value of the information they are likely to 
provide. 1 ; * ' j , 

Single laboratory tests are rarely ordered Rather, they are gener- 
ally obtained as "batteries" of multiple tests, which are often useful. 
For example, abnormalities of hepatic function may provide the clue 
to such nonspecific symptoms as generalized wealmess arid increased 
fatigability, suggesting the diagnosis of chronic liver disease, Some- 
times a single abnormality , : such as an elevated senmi calciuTn level, 
points to particular diseases, such as hyrxrparatJiyroidisiri blunder-' 
lying malignancy. " .«•.••../... 

The thoughtful use of screening tests shoukf not be confused with 
mchscriminate laboratory testing. The use of scree'rung tests is based 
oh the fact that a group of laboratory determinatibns can "be carried 
out conveniently on a single specimen of blooa 1 at relatively low cost. 
Screening tests are most useful when they are directed towards com- 
mon diseases or disorders in which the result directs other useful tests 
or interventions that would otherwise be costly to perform. Biochem- 
ical measurements, together with simple laboratory exarninations such 
as blood count, urinalysis, and sedimentation rate, often provide the 
major clue to the presence of a pathologic process. At the same time, 
the physician must learn to evaluate occasional abnormalities among 
the screening tests that may not necessarily connote significant disease. 
An in-depth workup following a report of an isolated laboratory ab- 
normality in a person who is otherwise well is almost invariably waste- 
ful and unproductive. Among the more than 40 tests that are routinely 
performed on patients, one or two are often slightly abnormal: If there 
is no suspicion of an underlying illness, these tests are ordinarily re- 
peated to ensure that the abnormality, does not represent a laboratory 
error. If an abnormality is confirmed, it is important to consider its 
potential significance in the context of the patient 's condition and other 
test results. :■ 

Imaging Techniques The availability of ultrasonography* a 'va- 
riety of scans that employ isotopes to visualize organs heretofore in- 
accessible, computed tomography, and magnetic resonance imaging 
has opened new diagnostic vistas and has benefited patients because 
these new techniques have largely supplanted more invasive ones. 
While the enthusiasm for' noninvasive technology is understandable, 
the expense entailed in performing these tests is often substantial and* 
should be considered when assessing, the potential benefits of the in- 
formation provided. ( ( 
PRINCIPLES OF PATIENT CARE Medical Decision- 
Making Both during and in particular after the physician has taken 
the history, performed the physical examination, and reviewed the lab- 
oratory and imaging data, the challenging,precess of the differential 
diagnosis and medical decision-making begins. Formulating a differ- 
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Figure A.23. Peripheral blood mononuclear cells can be isolated 
from whole blood by Ficoll-Hypaque™ centrifugation. Diluted 
anticoagulated blood (left panel) is layered over Ficoll-Hypaque™ and 
centrifuged. Red blood cells and polymorphonuclear leukocytes or 
granulocytes are more dense and centrifuge through the Ficoll- 
Hypaque™, while mononuclear cells consisting of lymphocytes 
together with some monocytes band over it and can be recovered at 
the interface (right panel). 
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Test accuracy studies assume the existence of a well-defined illness definition and 
clear-cut diagnostic gold standards or reference standards. However, in clinical 
reality illness definitions may be vague or a mere description of a set of 
manifestations, mostly clinical signs and symptoms. This can lead to 
disagreements among experts about the correct classification of an illness and the 
adequate reference standard. Using data from a diagnostic accuracy study in 
carpal tunnel syndrome, we explored the impact of different definitions on the 
estimated test accuracy and found that estimated lest performance characteristics 
varied considerably depending on the chosen reference standard. In situations 
without a dear-cut illness definition, randomized controlled trials may be 
preferable to test accuracy studies for the evaluation of a novel test. These studies 
do not determine the diagnostic accuracy, but lite clinical impact of a novel lest 
on patient management and outcome. 

Keywords Sensitivity and specificity, ROC curve, reference standards, carpal tunnel 
syndrome, uh rasonograph y 



The notion of a diagnostic gold standard or reference standard 
pertains to the hest available method for establishing the presence 
or absence ol a condition of interest, 1 i.e. the independent and 
correct classification of what is meant lo be the illness. 2 The 
traditional concept of a reference standard depends on a high 
level of biological understanding of the target condition and its 
causal underlying mechanisms. Typically, a morphological 
verification such as hisiopathology or angiography, is used to 
establish a 'definite diagnosis'. This definite diagnosis is assumed 
to be a reasonably reliable proxy measure of the true presence or 
absence of the condition of interest. 

In conventional diagnostic accuracy studies, ihe usefulness of 
a novel test for the inclusion or exclusion of a specific condition 
will be determined by comparing the results of the test with the 
definite diagnosis ascertained by the reference standard. 
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However, in clinical reality the biological understanding of 
conditions is frequently unclear. Illness definitions are vague 
or a mere description of a set of manifestations. In fields such 
as psychiatry and rheumatology, clinicians frequently use 
'syndromal diagnoses' consisting or a characteristic pattern of 
signs and symptoms, 3 while ihe biological understanding of the 
condition, of its causes, and its manifestations is incomplete and 
there is controversy about ihe manifestations that have lo be 
combined to ensure accurate representation of the condition. Tn 
other situations, the biological understanding ol the condition 
may be comprehensive, but the measurement of signs or 
symptoms is inaccurate. 

Two extreme conceptualizations of the reference standard 
may implicitly or explicitly be used in such circumsiances. One 
extreme ignores potential controversies and assumes a well- 
defined illness, which is objectively and reproducibly repre- 
sented by ihe outcome of one or several laboratory tests. The 
other extreme ignores potentially useful biological measures and 
focuses exclusively on patieni outcomes or on the need for an 
intervention. While these two outlooks aim at describing 
the same issue, they may create a schism when evaluating a 
diagnostic test. Below, we will explore this in a clinical example 
of an accuracy study previously published by our group in the 
field ol rheumatology' 1 and discuss the potential implications 
for clinical research into conditions without a clear-cut reference 
standard by which lo establish a diagnosis. 



953 



954 INTERNATIONAL JOURNAL OF EPIDEMIOLOGY 



Clinical example 

Carpal Tunnel Syndrome (CTS) is an important cause of 
functional impairment and pain of die hand, which presumably 
results from a compression of die median nerve at die wrist. 
Unfortunately, there is no universally accepted reference 
standard to establish the diagnosis. In our experience, iwo 
different approaches towards CTS classification are used. 
Neurologists traditionally establish the definite diagnosis based 
more on the outcome of nerve conduction studies than on 
die patients' signs and symptoms. In contrast, hand surgeons 
appear to give considerably more importance to the patients' 
signs and symptoms, the: severity of complaints and the likely 
need for and success of a surgical intervention than to nerve 
conduction studies when establishing the definite diagnosis. In 
our accuracy study,' 1 we relied on current practice and pre- 
specified the neurologists' definite diagnosis as the reference 
standard. Mere, we determine the impact ol using either ol the 
two 'reference standards' on the estimated test accuracy of 
sonography in patients with suspected CTS. 

Methods and results 

Details of methods are reported else wh ere. '* We assessed 77 
patients for eligibility, excluded 3 because of traumatic wrist 
lesions, and enrolled 74 referred to the outpatient clinic of the 
Depart mem of Hand Surgery at the University Hospital Berne. 
Switzerland, between January and December 2002. 

Patients included in the study had a mean age of 51 years 
and 48 were females (65%). The How of patients through the 
various stages of the study is described elsewhere. 4 Essentially, 
101 wrists from 71 patients were included in the analysis. 

Standardized nerve conduction studies were performed by 
one of several neurologists, who were unaware of the results 
of the sonographic examination. The sonographic evaluations 
were performed by a rheumatologist experienced in muscu- 
loskeletal sonography, who was unaware of the results of the 
nerve conduction studies and of the patients' signs and 
symptoms. He performed transverse imaging of the median 
nerve for the area ranging from the distal forearm to the outlet 
of the carpal tunnel and measured the largest cross-sectional 
area of the median nerve in square millimetres. We used this 
measure as a single diagnostic indicator, assuming that an 
increase in cross-sectional areas is associated with an increasing 
likelihood of disease or disease severity. 

Table 1 presents a comparison of definite diagnoses accord- 
ing to neurologists' and hand surgeons' judgements. Overall 
agreement was 86%. One out of 23 wrists classified as normal 
by the neurologists was considered as CTS by the hand surgeons 
(4%). This wrist had normal nerve conduction studies. 

Table 1 2X2 contingency table comparing reference standard 
classifications according to neurologists and hand surgeons 



Hand surgeons' Judgements 
CTS prcscn t CTS a hsen t Total 



Neurologists' judgements 








CTS present 


65 


13 


78 


CTS absent 


1 


22 


23 


Total 


66 


35 


101 



Conversely, 13 out of 78 wrists classified as CTS by the 
neurologists were considered normal by the hand surgeons 
(17%); all 13 wrists had pathological nerve conduction studies. 
The resulting kappa for the agreement between the two illness 
definitions was 0.67 [95% confidence interval (CI) 0.48-0.85). 

For both reference standards, we filled a receiver operating 
characteristic (KOC) curve for diagnosis of CTS by sonography, 
using a maximum likelihood logistic regression model based on 
robust standard errors, which allowed for the correlation of 
characteristics of wrists within patients and compared the area 
under the ROC curve. Figure I shows the filled HOC curves 
using either the neurologists' judgements (top) or the hand 
surgeons' judgements (bottom) as the reference standard. The 
area under the ROC curve lor ultrasound was 0.89 based on 
neurologists' judgements (95% CI 0.82-0.96) and 0.77 based on 
hand surgeons' judgements (95% CI 0.68-0.87). The difference 
between the two areas under the ROC curve was 0.12 (95% 
CI 0.0-0.23). 




False positive rate (1 - specificity) 

Figure 1 Fiueti ROC curves (solid curve) for diagnosis of CTS by 
sonography with 95% confidence interval (doucd curves), considering 
the neurologists' de-Unite diagnosis (top) or the hand surgeons' definite 
diagnosis as the reference standard (bottom). The broken diagonal tine 
represents a hypothetical HOC curve of a test thai yields no diagnostic 
information 
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Discussion 

Even though the agreement between the two employed illness 
definitions was subsuvntial (a kappa of 0.67), the estimated lest 
performance of ultrasound varied considerably depending on 
the definition used as the reference standard. The diagnostic 
accuracy of sonography in patients with suspected CTS was 
good to excellent according to one reference standard hut only 
moderate according to the other. 

The lack of consensus on an illness definition may impede 
a valid evaluation of diagnostic technology in test accuracy 
studies. Considering thai the final purpose of any novel test is 
to improve patient management and outcome, the traditional 
paradigm of test accuracy studies will only be useful if a 
reference standard is chosen that either has a strong associa- 
tion with patient outcome or a direct relationship with patient 
management. In our accuracy study 4 we argued, for example, 
that the neurologists' definite diagnosis directly pertains to 
clinical decision making and patient management. 

Ultimately, the use of a diagnostic test and its potential 
therapeutic consequences can be considered as two consecu- 
tive steps of the same management strategy. Analogous to 
traditional research into therapeutic interventions, randomized 
trials may be designed to compare different strategies. In such 
trials, patients will be randomly allocated to a management 



strategy that includes the use of a novel test wmWr evaluation, or 
10 a strategy that uses standard tests only. Ascertained outcomes 
may relate to parameters of patient management (e.g. length ol 
hospital stay), to patient outcome (e.g. pain), or to the total cost of 
management per patient* If an unanimously accepted reference 
standard is lacking, as is the case in CTS, such randomized 
controlled trials may be more appropriate than test accuracy 
studies to determine the usefulness of a novel diagnostic test. 
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